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ABSTRACT 

We consider the Gumbel or extreme value statistics describing the distribution function 
PG(^'max) of the maximum values of a random field i' within patches of fixed size. We 
present, for smooth Gaussian random fields in two and three dimensions, an analytical 
estimate of pq which is expected to hold in a regime where local maxima of the field 
are moderately high and weakly clustered. 

When the patch size becomes sufficiently large, the negative of the logarithm of 
the cumulative extreme value distribution is simply equal to the average of the Euler 
Characteristic of the field in the excursion ^ J^max inside the patches. The Gumbel 
statistics therefore represents an interesting alternative probe of the genus as a test of 
non Gaussianity, e.g. in cosmic microwave background temperature maps or in three- 
dimensional galaxy catalogs. It can be approximated, except in the remote positive 
tail, by a negative WeibuU type form, converging slowly to the expected Gumbel 
type form for infinitely large patch size. Convergence is facilitated when large scale 
correlations are weaker. 

We compare the analytic predictions to numerical experiments for the case of a 
scale-free Gaussian field in two dimensions, achieving impressive agreement between 
approximate theory and measurements. We also discuss the generalization of our for- 
malism to non-Gaussian fields. 
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1 INTRODUCTION 

Gumbel or extreme value statistics are concerned with 
the extrema o f samples drawn from random distributions 
l|Gumbe]|[l95i ). In the case of sample means, the Central 
Limit Theorem states that the means of many samples of 
size A'^ drawn from some distribution will be normally dis- 
tributed in the large- A*' limit; analogously, it can be shown 
that in the same limit the cumulative distribution of the 
sample maximum or minimum i/ will tend to one of the 
family of the following functions. 



Go = exp[-exp(-y)]. 



(3) 



with 



(1) 



(2) 



where a an d b are location and scale parameters (see, e.g., 
IColej|200ll ). The shape parameter 7g characterizes the dis- 
tribution: a distribution with 7g = is known as having the 
"Gumbel type" , 
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while 7g < and 70 > correspond respectively to forms 
of the "negative WeibuU type" and the "Frechet type" . 

Distributions given by equation ((TJ have seen ap- 
plication to time- series data in many fields such as cli- 
mate (see, e.g.. iKatz fc Brownl Il992l ). hydrology (see, 
e.g., iKatz. Pa,rlange fc Naveaul booa ). seismology (see, 
e.g., IComelTf 196811. insura nce and finance (see, e.g., 
lEmbrechts fc Schmidlil 1 19941 ). etc, in predicting the inci- 
dence of extreme events from knowledge of past data. Here 
we consider applications to two and three dimensional ran- 
dom fields relevant to cosmology, but our approach is suf- 
ficiently general that extension to other fields should not 
prove difiicult. 

In three dimensions, one is naturally 
ested in the occurence of most massiv e 
ters in galaxy surve ys jBhavsar fc Barro' 
Holz fc Perlmutterl I2OI0I : ICavon. Gordon fc Sil 
Davis, et al. _ 2011 ), of large scale ma ss concentrations 



jive 
owl 

iny 



inter- 
clus- 



1985; 



20101; 



dY amila Yarvura. Baugh fc Anguldl2010l) such as the Sloan 
Great Wall (|Gott. et al.l 1200511 .or of the largest voids 
observed in the spatial distribution of galaxies. In two 
dimensions, the most obvious application concerns the 
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temperature fl uctuations in the Cosmic Mi crowave Back- 
ground (CMB, iMikelson. Silk fc Zunt3l200c ) in p articular 
the analysis of the hottest hot spots (|Colee 1 19881 ) and the 
coldest cold spots. There are several works that suggest the 
exist ence of anomalies in cur r ent CMB experiments (see, 
e.g., iLarson fc WandeitI |2004 lAvaita. Weber fc WetterichI 
l201Cf ). the most proeminent one being the cold spot discov- 
ered in the temperature maps measure d by the Wilkinson 
Microwave Aniso tropy Probe (WMAP, IVielva. et al] 12004 
ICruz. et al.ll2005l '). 

In this work, we consider a random field in two or 
three dimensions, smoothed on some scale I, and take large 
"patches" of size L ^ £. The values of the field at all points 
inside a given patch constitutes one sample, and the ex- 
treme value of the field in the patch our statistic of interest 
(henceforth we restrict ourselves in particular to the max- 
ima, though the case of patch minima is exactly analogous). 

Although some of the results present in this paper also 
apply to the non-Gaussian case, we restrict our practical 
calculations to Gaussian fields of known power spectrum 
and ask how to derive analytically the distribution of patch 
maxima and how to explicitly relate the results to eqs. ^ 
and ([2]). This effort is not new: the calculation of the ex- 
treme value distribution of Gaussian fields has been paid a 
lot of attention by mathematicians, for time-series (see, e.g., 
iLeadbetter. Lindgren fc Rootzenj 1983|) but also in larger 



number of dimensions (see, 
in 



e.g^ Bickcl & Rosenblatt 1973; 

Rosenblattlll976l: lAdleij|l98ll : lAldoua,198a : .Piterbarg.l996i : 



Adler fc Tav'io^l2007l ) . leading to a number of important re- 



sults. In particular, in the very large patch limit, convergence 
to the Gumb el type distribution (f3|) wa s rigorously demon- 
strated (e.g., lBickel fc RosenblattlFgrsI ). Thus, many of the 
results derived in this article can be found in the mathemat- 
ical literature, but we recall them for sake of completeness 
as they are needed to understand how the statistics behave 
in various regimes. 

The method we employ to estimate the extreme value 
statistics relies on a local maxima approach and was al- 
ready utilised in a mo re rigorous mathematical set up (e.g., 
I Adler fc Tavlor"2007l). The central point is the observation 
that the probability of the patch maximum being below 
some specified density threshold is exactly the probability 
of encountering zero points above that threshold. If we also 
identify the highest point in the patch with the field high- 
est peak there (an assumption which is in fact non-trivial in 
general; see section 12.2.11) then the problem is reduced to 
that of finding the void probability for peaks as a function 
of threshold. 

The distribution and clustering statistics of peaks has 
achieved a good deal of attention in the astrophysical litera- 
ture, in part due to their role as the nucleation points of rich 
galactic clusters when the field in question is that of mat- 
ter overdensities. Combining results obtained previously for 
the peak abundances and their correlation functions allows 
us to predict the void probability for both two and three 
dimensional fields, and hence the extreme value statistics. 
Once again, although most of the results can be found in 
the mathematical literature, the key novelty of the present 
paper lies in the derivation and test of approximate predic- 
tions in an intermediate regime where the patch size is large 
enough compared to the coherence length of the field, but 
not so large that either the asymptotic limit expected for 



Gaussian random fi elds l|Bickel fc Rosenblattlll973l '). Q, or 
the Poisson regime (|Aldouslll989f r have been reached. 

This paper is organized as follows. Section [5] sets up the 
general framework, with a few definitions followed by gen- 
eral results. In particular, the Gumbel statistics is related to 
the void probability, which is expressed in terms of average 
number density of peaks above some threshold and their A'^- 
point correlation functions. Section [3] focuses on the Gaus- 
sian field case, where a general estimate taking into account 
full clustering of the peaks is performed and explicit asymp- 
totic formulae are derived and related to equation ([ij . Con- 
vergence to for Gaussian random fields is recovered when 
the patch size tends to infinity. An explicit link to the Eu- 
ler characteristic is also established, in agreement with the 
literature. In section O we test the theoretical predictions 
against numerical experiments in the two-dimensional case. 
Finally, section [5] summarizes the results obtained in this 
article and discusses their generalization to non-Gaussian 
fields. 



2 THEORY 
2.1 Definitions 

We consider, in a _D-dimensional space with D = 2 or 3, a 
random field S{x) of zero average. We suppose that this field 
is statistically stationary (invariance of the A^-point corre- 
lation functions by translation) and isotropic (invariance of 
the A'-point correlation functions by rotation). 



2.1.1 Smoothing window 

This field is smoothed with a window of size £: 

Se = S*Wiix). (4) 

For instance, the Gaussian smoothing window Wi{x) oc 
exp{—x^/2£^), which we shall chose for all practical calcula- 
tions, reads in Fourier space 



lyf(fc) = exp[-(H)V2]. 



(5) 



The top hat smoothing window will be needed as well, on 
scales L ^ £. In 3D, it is a sphere of radius L which reads 
in Fourier space: 

WL{k) = 3[sin(A:L) - kLcos{kL)]/{kLf . (6) 
In 2D, it is a disc of radius L which reads in Fourier space: 
2Ji(fcL) 



WLik) 



kL 



(7) 



where Ji is the Bessel function of the first kind and of first 
order: 



Ji{x) ^ - / cosfj/ - a;sin(2/)ldy. 
^ Jo 



2.1.2 Gumbel statistics 



(8) 



From now on we measure the height of the field using the 
the density contrast in units of its standard deviation. 



(9) 
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with 

2 
'^0 



(Si)- 



(10) 



We consider a large spherical patch of size L at random 
position xq, L ^ £ and measure in that patch the maximum 
value of the smoothed density field: 



: \x — a;o| ^ I/} . 



(11) 



The goal is to study the Gumbel statistics, i.e. the proba- 
bility distribution function PG(i'max)di'max of the values of 
i^max when we choose an infinite number of random realiza- 
tions of xq. This distribution contains a dependence on the 
choice of smoothing and on the size of the patch, 

PG(l^max) =PG(j^max,^, i), (12) 

which we leave implicit in the remainder of the paper. 
2.2 General results 

In a sufficiently non-degenerate field Se , the set of local max- 
ima -the peaks of the density field- is a discrete ensemble 
of points of positions pi and density i/i = Si/ao- We shall 
assume that this property is valid in all the subsequent cal- 
culations. 



2.2.1 Fundamental assumption 

The fundamental assumption we make is that the maximum 
of the density in a patch can be approximated by the density 
at the highest peak contained in the patch: 



\pi — xo\ < L} . 



(13) 



This assumption is valid only in the regime where L ^ £. 
Indeed, there can be local maxima outside the patch but 
sufficiently close to its edge such that the density measured 
at a point on the edge of the patch is larger than the maxi- 
mum density measured in the set of peaks contained in the 
patch. In other words, if we consider the population of local 
maxima of the density field defined inside the D — 1 dimen- 
sional manifold given by the border of the patch, of densities 
Uj, then we have, in realityQ 



max(i/max, i'j) ^ l^n 



(14) 



Obviously, one expects /^max to approach t-max as the ratio 
L/£ increases and the ratio of the patch volume to area near 
its edge decreases. 



2.2.2 General expression of the Gumbel statistics 
Let us define the cumulative Gumbel distribution by 



Pg(«^) = Prob.(!^niax < l^) 



PG(fmax)d!^„ 



(15) 



Such a probability, given the assumptions of § 12.2.11 is also 
the probability that none of the local maxima contained in 
the patch are above the threshold. In other words, if we con- 
sider the population of local maxima satisfying Ui > u, none 



of them belongs to the patch. This happens with a probabil- 
ity Po{v), where Po is the probability of finding no maxima 
with normalized density larger than v inside a spherical cell 
or a disc of radius L: 



hence 

, . dPo 



(16) 



(17) 



The calculation of such a void probability can be performed 
using standard count-in-cell formalism if the number den- 
sity n(yi > v) and the connected A'^-point correlations func- 
tions □ ( xi, ■ ■ ■ , xn ), of local maxima above the threshold , 
are known (IWhite 1 19791 : IfVvI|1985I : iBa ian fc Schaeffeijll989l : 
ISzapudi fc Szalavll 19931 ). 

In particular, one can define the averaged correlations 



over a cell of size L and volume V = {4n/3)L 
V = 7^L^ 



1 

yW 



d x\ ■ 



■ d^xjv(^j^(a;i, • • • ,a;iv). 



the normalized cumulants. 



and 



SI 



SI- 



N, = nVil{L). 



(18) 



(19) 



(20) 



Each of these expressions contains an implicit i^-dependence. 
The number A^c represents the typical number of peaks 
above the threshold per overdense patch in excess to the 
average. It measures the deviation from a pure Poisson dis- 
tribution due to clustering. 

With these definitions, the void probability can be writ- 
ten 



-Po(fmax) = exp [-nVa{Nc)\ , 
with 



OP 

<- ^ X 'C i\N-l^N N-l 



(21) 



(22) 



The challenge is now to relate the statistical properties of the 
local maxima to that of the underlying density field. This is 
made difficult by the fact that the void probability depends 
on the full hierarchy of correlations up to any order: in par- 
ticular one has to relate the A'^-point correlation functions of 
the peaks to the A^-point correlation functions of the under- 
lying density field. We denote the latter by £,n{xi, ■ ■ ■ ,xn), 
and similarly the averaged A^-point correlation functions of 
the density field by (,n{L). 

This exercise has been performed in detail for random 
Gaussi an fields by Bardeen. et ahl (|l986l ) (hereafter BBKS) 
and bv lBond fc Efet athiou ( 1987h (hereafter BE) in the 3D 
a nd the 2D cas es re spectively, extending earlier calculations 
of lKaisej l|l984) and lPolitzer fc Wisd ^M) ■ 

Note that these 



bee lAdler fc Tavloil ||2007| ') for a rigourous formulation corre- The connected A^-point correlation functions are equal to zero 

sponding to a more general patch shape than just a sphere. for a Gaussian field ii N ^ 3. 
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latter did not consider statistics of peaks, but more gen- 
erally of regions of 5i above the density threshold. How- 
ever, in the rare event regime considered here, the two ap- 
proaches should become equivalent (this is discussed in de- 
tail in BBKS). 

The non Gaussian case has been examined as well 
for a quite general c lass of hierarchical models by 
iBernardeau fc Schaeffeil (hereafter BS). The statis- 

tics under consideration in that work was that of overdense 
cells of size I and not of peaks of the density field smoothed 
with a Gaussian window of size I. Again, the approach of 
BS should give the same results as those obtained for peaks 
in the rare event regime. 



2.2.3 Asymptotic expression of a{y) 

A fundamental result of the calculations of BS is that in the 
high peak limit, i.e. v ^ 1, and at large enough separations, 
i.e. 



then, 



« 1, 



, Xn 



trees labels links 



(23) 



(24) 



in the notation of these authors. This expression is valid 
at least in the framework of the minimal hierarchical-tree 
model. The trees refer to ensembles of distinct pair associ- 
ations of elements in the ensemble {1, • ■ • , A'^} such that a 
fully connected structure containing exactly the N elements 
is constructed without any loop. The labels take into ac- 
count all the possible combination of elements in {1, ■ • ■ , N} 
that lead to the same tree topology. In each tree topology, 
there are always N—1 links, by definition. The total number 
of combinations of all the trees and the labels yields N^~^ 
possibilities. Figure 1 of BS can be examined to understand 
the process. For instance, for the 4-point correlation func- 
tion there are 2 tree topologies: (i) the 'star' where one point 
is connected to all the others, and (ii) the 'line' where one 
point is connected to one or two other depending on its po- 
sition (at the end or in the middle). There are 4 possible 
labelings of the star and 12 possible labelings for the line. 

Equation (|24p also applies to Gaussian fields, indepen- 
dently of the shape of the smoothing window, at least if 
v ^ 1 and the following condition, more restrictive than 
(l23l) holds: 



(25) 



^2§2ix^ « 1. 



Indeed, the unconnected part of the TV-point correlation 
function (the moment), fi„, is given by 



n 



+11 



(26) 



in the high threshold regime l|Politzer fc Wisdll984l ). Ex- 
tracting the connected part from this expression consists 
exactly in extracting the ensemble of distinct pairs associ- 
ations in {1, • • • ,N} such that the corresponding topology 
is fully connected. In the large separation limit, i.e. at lead- 
ing order in ^2{xi,Xj), or equivalently if the condition (|25p 
is verified, only the tree topologies remain (because they 



correspond to the minimum power in while being fully 
connected), and each label for each tree is given the same 
weight in equation (|26p. hence leading to equation (|24p in 
that regime. 

Equation (|24p . reads, after volume averaging in a sphere 
of radius L (BS) 



5P(L)~7V^-^ 



(27) 



This result applies as well to the general tree-hierarchical 
model (BS). This means that the function a defined in equa- 
tion ^ reads (BS) 



^(y) = (i + i^) 



= y. 



(28) 



Note that when TVc <C 1, which occurs at some point for a 
large enough value of for which there are very few peaks 
above the threshold in average per patch. 



Nc/2 



(29) 



by definition (ea.\22^. Therefore, even though we expect the 
low end tail of the Gumbel statistics to be affected by the po- 
tential crudeness of our approximation of the function cr(j/), 
the high end tail should still be quite well described. 



3 THE GAUSSIAN FIELD CASE 

Once the function cr(y) is specified, one needs to carry out 
the calculation of the number density of peaks above the 
threshold as well as their averaged two-point correlation 
function. The detailed expressions can be found for a Gaus- 
sian field in BBKS and in BE. 



3.1 Shape parameters of the power spectrum: 7 

and Ri, 

The important parameters that control the number density 
of peaks above threshold f and their two point correlation 
function are the moments 



dk 



-P{k)W^{k)k 



2j 



(30) 



where D = 2 or 3 is the dimension of the space considered. 
Then BBKS and BE define the coherence parameter 7 



(31) 



7 = 

O"ocr2 

and the scale length J?* by 

For a Gaussian smoothing window and a scale free 
power spectrum P{k) given by 



(32) 



P{k) = Ak" 



(33) 



the integrals in equation (|30p can be performed analytically, 
yielding the simple expressions 



2 



(£) 



-(n + D) 



47rO-i 



n + D 
n + D + 2' 



R. 



A ^ ( n + D 



l/(n+D) 



2D 



n + D + 2 



,(34) 
(35) 
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valid for n > -D0 

For a top-hat smoothing the scahng law (|34[) remains 
valid with a diflterent correla tion length, which for the 3D 
case can be written (see, e.g. jLokas. et al1ll996l ) 



9 A r[(n + 3)/2] r[(l-w)/2] 
87r3/2 r[l-n/2] r[(5-n)/2] 



l/{n + 3) 



(36) 



On the other hand, we did not find any simple analytic ex- 
pression of the correlation length for a top-hat smoothing in 
2D. 



3.2 Number density of peaks 

The number density of peaks above the threshold is 



n{u) 



dv'M{v') 



(37) 



This integral is easily performed numerically, using the fact 
that the function Af{v) is given by 



-^e '''''^Gd(7,7//) 



(38) 



(27r)(-D+i)/2i?| 

in D dimensions, with Gs approximated by equation (4.4) 
of BBKS and G2 given by equation (A1.9) of BE. For com- 
pleteness, we rewrite these equations here: 

- 37^w -f {Bw^ + Ci) exp(-Aw^) 



03(7, U) 

with 

A = 

B = 

Ci = 
C2 = 



1 + C'2 exp(-C3w) 



2(9-572)' 
432 



\/T(hf(9- 572)5/2 
1.84 + 1.13(1 -7^)■'^■^^ 



91 -Hl.27exp(6.5l7 ), 
C3 = 2.58exp(1.057^), 
and 

G2(7,w) = 



+ 



+ 



(39) 

(40) 
(41) 

(42) 
(43) 
(44) 




(45) 



3.3 Correlation function of peaks 

In the large separation regime the two point correlation func- 
tion of the peaks (|25|l reads, if one neglects contributions 
from higher order derivatives of ^2{r), which is a fair ap- 
proximation according to BBKS and BE if ^2 is a power-law 
of negative index, 

^ The case D=3 was derived in BB KS, with the expres sion for 
the correlation length £0 given in e.g. iLokas. et af] lll996l') . 



?2 - — 2~^2, 



where 



and the effective threshold I'iy) writes 
70 



(46) 



(47) 



with 6 approximated by equation (6.14) of BBKS in 3D 
^ ^ 3(1 - 7') + (1-216 - 0.97") exp[-7/2(7/./2)2] 



(48) 



(49) 



v/3(l - 72) + 0.45 + (71^/2)2 -I- 71./2 
and given in 2D by 

^2(7,7^^) 

where H{j,w) = dG2/dw is given by equation (A4.7a) of 
BE. For completeness, 



(50) 



H{'y,w) 



+ 



2w s 1 ~ 2'^'^^'" 



4(1-7^)^ 



^2(1- 7^) 
exp 



(3 - 272)^27r(l-72) 
2w 

(3 - 272)3/; 



2(1-72) 



■ exp 



3-272 



1 — 2®''!'^ 



V2(l-7^)(3- 272) 



(51) 



In the large threshold limit, we simply have 

(i>) V ^ 00, (52) 

as derived bv lKaiseH (|l984h . After averaging over volume V , 
the expression of (,2{L) is thus simply given by 



g(L) = (L) 



(53) 



where (,2{L) is the averaged two-point correlation function 
of the underlying density field. It can be derived easily from 
the power spectrum of the underlying (smoothed) density 
field, 5i, using equation (|30p with the top hat window and 
replacing I with L. The largeness of the patch size, L, com- 
pared to the smoothing scale, I, should guarantee that the 
large separation approximation (|25|l is verified in practice. 
This can be checked a posteriori by examining the range of 

values of v wher e PG(t^) is significant. 

Note that iHeavens fc Shetbl l|l999l) and iDesiacgue^ 

l|2008h (the latter in the large separation limit) performed 
the exact calculation of the two-point correlation function 
of peaks in 2D and 3D, respectively, by taking into account 
corrections depending on second and fourth derivative of 
the two-point correlation function of the underlying field. 
These corrections can be significant on large scales, where 
the power-spectrum of the underlying field significantly de- 
viates from a power-law. However, they get progressively 
smaller with increasing threshold v, and in the rare event 
limit in which we are working here, they are probably ir- 
relevant, except perhaps for the low-;^ tail of the Gumbel 
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statistics. Still, this assumption should be checked explicitly 
for the Cold Dark Matter case by assessing the differences 
in the distribution introduced by computing the two-point 
correlation function of peaks with and without them. Such 
an investigation is beyond the scope of this paper, but it 
should be kept in mind when comparing analytic estimates 
of the Gumbel statistics to real data. Also note that in this 
more rigorous context, the simple proportionality relation 
H53p does not apply anymore. 



3.4 Asymptotic regime 

A particularly interesting case corresponds to the regime 
1/ ^ 1 and the Poisson limit Nc <^ 1, where a{Nc) — 1. 
Such a regime is expected to be reache d if L/£ i s sufficiently 
large and has been studied previously (|Aldouslll989. ). How- 
ever, since they will prove to be very useful to understand 
the intermediate (as opposed to asymptotic) regime that we 
discuss later, we recast the main results using our formal- 
ism. Ifere, the calculations will be facilitated by examining 
the cumulative Gumbel distribution, Pa{i^)- 

In the large u regime, the number density of peaks is 
proportional to the Euler Characteristic £ of the underlying 
density fielcQ (e.g., BBKS, BE). Just how large a value of i/ is 
required for this to be true depends on the level of accuracy 
one aims to reach in the description of the function Pg('^)- 
For instance, BBKS suggest 'yf > 2.5 in 3D for a 10 percent 
accuracy on approximating n{i') by the Euler Characteristic. 

With the additional assumption that A'^c ^ 1, equations 
(4.14) of BBKS and equation (3.3) of BE read, respectively 
in 3D and 2D, 



with 



Ud 



exp(— £3!/) 
— exp 



(73(//' 



l)exp ( - — 



exp(— £2^) 



■ exp 



-U2 !^exp 



(27r)(-D+i)/2i?f 
For scale free power-spectra we have 



-i)/2 [ 2D ) [ij 



(54) 



(55) 



(56) 



(57) 



Note that in the original derivation ()AldouJl989 h. the right 
hand part of equation (|54p contains a term scaling like i/^ 
and not — 1. Recall however that these expressions still 
assume v sufficiently large compared to unity. Note also that 
in the ver y large threshold limit, one recove rs the classical 
result fsee lAdlej[l98ll : lAdler fc Tavlojl2007h : 



l-f'G,D(!^> 1) iifuV. 



(58) 



* The Euler characteristic is seen here as an alternate count of 
critical points number densities of various kinds included in the 
excursion in regions with n ormalized density larger than u, see e. g. 
IColombi. Pogosvan fc"Souradccp. 1.2000 '); Adlcr fc TavloJ ll2007t) . 



In the calculations presented in lAdler fc Ta^ l|200'i1) , 
though, the edge effects discussed in § 12.2.11 are not ne- 
glected, i.e. SnV must be in fact viewed as the ensemble 
average of the Euler characteristic of the intersection be- 
tween the excursion and the patch0 

An interesting value of v corresponds to 



(59) 



or Pq — 1/e. Obviously we must have z/* sufficiently large 
compared to unity for equations (|54|) and (|55|) to hold, as 
well as 



(60) 



to remain in the Poisson limit. The last condition imposes 
a constraint on the size L of the patch, which must be large 
enough compared to the smoothing scale £. This obviously 
depends on spectral index: the ratio L/£ should be larger if 
n is small since there is more power on large scale. 
Asymptotically, i^* reads 



I/* ~ y/2 In Ud 



1 + 



(D- l)ln(21n(7i3) 



41nl7o 



(61) 



This equation shows that i^* grows rather slowly with L/£. 

Now we compare the expressions (|54p and (|55p to the 
standard cumulative form ([l]). To determine the parameters 
a, b and jq of eqs. lU and we perform a second order 
Taylor expansion near y = 0, where Gj^{y = 0) = 1/e. 

At second order in 'yay we have 



ln(-lnG,^) 
Similarly we have 

ln(-lnPG,3) ^ 



y 

y-f 7Gy. 



(62) 



-^+ln[[/(a^-l)] 

abja^ - 3) b\a^ + 3) y^ 
' (a2-l)2T' (63) 



ln(-lnPG,2) ==; ^—+\n{Ua) 

Our particular choice of the expansion is convenient since it 
implies 



Then 



b3 = 
62 = 

7G,3 



1 uj-l 



<0, 



(65) 

(66) 
(67) 

(68) 



^ Note that having a very accurate determination of the high v 
tail of the Gumbel distribution has been paid a lot of attention by 
mathematicians and numerous methods have been employed to 
do so, estimating the E uler characteristic being one amongst them 
(see the introduction of lAzai's fc Delmaj|20oj ^ for a panorama on 
various methods). 
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7G,2 — 



<o, 



(69) 



where the labels 2 and 3 refer to the number of dimensions 
considered, D. It is interesting to study the asymptotic val- 
ues of these parameters when InUo 3> 1, hence when P-^ is 
very large: 



7g 



thus 



IGy ~ 1. 



(70) 



(71) 



We can thus understand that the range of validity of the 
Taylor expansion translated in terms of the u variable is 
V £ — e),Vt,{l + e)], where e is a fraction of unity, 

for both Pciy) and G-y(,{u). While Ui, does not in general 
correspond exactly to the position of the maximum of pc iy) 
and 



AG-, 



Av 



(72) 



it is rather close to it, and increasingly so as L/l becomes 
larger. This means in practice that the functions pc (j^) and 
(v) should be well described close to their maximum by 
our second order Taylor expansion in an interval correspond- 
ing to confidence levels up to ~90-95%. Hence, the functions 
Pg{i^) and g^^ {v) should match each other quite well in that 
interval, but not in the tails, especially in the large u one. 

Notice as wel l that 70 is neg ative, as measu red exper- 
imentally by e.g. iMikelson. Silk fc Zuntz (|2009l ) and that 
it converges to zero, so that the function -Pg(i') con- 
verges to eg. Q as demonstrated more rigorously b y e.g. 
iBickel fc Rosenblatt! l| 19731 ) (see also lRosenbiattlll976l ). 

This asymptotic result was first exploited in cosmology 
bvlColcs (1988) to analyze the hottest hot spots in temper- 
ature fiuctuations of the CMB. However, it is important to 
notice that convergence to the form Q is rather slow. On 
the other hand, the form ([TJ with values of a, h and 7g given 
by equations ((53), ((Ml), ([SB, dSH)) and (gSJ along with im- 
plicit equation (|59|) to determine i^* remains always a good 
fit of equations (|54p and (|55|l in the 90-95% confidence re- 
gion, but again, not in the tails. These analytical results will 
be illustrated explicitly in § |4l 



4 MEASUREMENTS 

To check the validity of the theoretical calculations, we per- 
formed numerical experiments in the 2D case. We generated 
scale free random Gaussian fields on sets of 100 realizations 
on a grid of size 4096^ for each value of the spectral index we 
considered, n = Q, —0.5, —1 and —1.5. Smoothing was per- 
formed with a Gaussian window of size 1 = 5 pixels and 400 
non-overlapping circular patches of radius L = 100 pixels 
were extracted from each realization, amounting to a grand 
total of 40000 patches to measure pciv) for each value of n. 

The results are displayed in Figure [1] for n = 0, —0.5 
and in Figure [2] for n = —1.0 and —1.5. Agreement be- 
tween the measurements and theory is spectacular-with the 
best results in the high-i/ region, as expected. Even the case 
n = —1.5 is well described by theoretical predictions de- 
spite the fact that condition (|25p is broken while Ac ^ 1. 
Except for n = — 1, the low-f tail is slightly off the theory. 



which overestimates a bit the measurements for n > — 1 and 
significantly underestimates them for n = —1.5. Still, our 
analytic calculations are sufficiently accurate to define con- 
fidence regions with an accuracy level of a few percent. As 
expected, the effect of clustering between peaks decreases 
with increasing n and becomes rather small for n > — 1, 
given the choice of L/l = 20. In that regime, the number 
density of peaks is in fact well approximated by the Eu- 
ler Characteristics, and the function pg{v) is well fitted by 
a negative WeibuU type form (eq. [ij with the parameters 
derived in § 13.41 except in the high v tail, as expected. In 
agreement with the predictions of i; 13.41 |7g| decreases with 
n and the asymptotic regime ((3)l is approached slowly al- 
though not reached yet, especially in the tails. It was indeed 
argued in § 13.41 that convergence to it is rather slow and 
requires an increasingly large value of L/l as —n becomes 
larger. Note that the data points or the solid curves can also 
be fitted eas ily by the for m (|T| with appropriate choice of 
a, h and 7g (|Mikelson. Silk fc Zunt j|2009l ). except of course 
for the high u tail. For simplicity we did not show that fit 
because it is purely phenomenological and there is no simple 
analytic expression for a b and 7g except in the asymptotic 
regime studied in § 13.41 



5 SUMMARY AND DISCUSSION 

We have computed analytically the Gumbel statistics for 
random Gaussian fields smoothed with a Gaussian window 
of size I. The Gumbel statistics, PG(t'max)dz^max, represents 
the probability distribution of the maximum i^max of the 
field estimated in a patch of size L thrown at random. Our 
important results can be summarized as follows: 

(i) For L sufficiently large in front of I, I'max can be 
approximated by the maximum value of the density esti- 
mated at the positions of the peaks included in the patch. 
As a result, the cumulative Gumbel distribution, Pg{i') ~ 

PG(j^max)di^niax, cau be sccu as the void probability Po 
of finding no peak with density larger than v in the patch. 

(ii) We have made u se of the s t andard counts-in-ce l ls for- 
malism (■Whit^ 1 19791: Ifivl 1 19851 : iBaUan fc Schaefferl Il989l : 
ISzapudi fc Szalavl Il993i r to compute this void probabil- 
ity as a function of the average number of peaks above 
the threshold in the patch, n{i')V, and their correlation 
functions averaged over the patch, ,^j^(L). These quanti- 
ties were themselves calculated using results of the litera- 



1987 ) 



19991 ) 



ture: iBardeen. et al.l (|l986l ) and Bond fc Efstathioul 
to estimate n(i>) and ^?: iBernardeau fc Schaefferl ^^.^^^ 
and lPolitzer fc Wise! (|l984l ) to evaluate higher order correla- 
tions through a hierarchy of normalized cumulants given by 
5*^ = ^^/(CD^"^ = N'^-^. Rigorously speaking, these cal- 
culations are only valid in the large separation limit, ^2^1 
and for 3> 1. They also neglect contributions from higher 
order derivatives of the correlation function of the density 
field, which can in princ iple be taken into account following 
iHeavens fc ShethI (|l99i) and lDesiacaue3 (|2008l ). 

(iii) In the regime f ^ 1 and in the Poisson limit, Ac = 
nV^2 ^ 1) the quantity — In(PG) is simply given by the 
Euler Characteristic of the excursion (Aldousi il989 ). This 
allows one to derive tractable analytical expressions for the 
Gumbel statistics (eqs. 1541 1551 156|l . We have shown that in 
this case Pg{j^) is well fitted by a negative WeibuU type 
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Figure 1. The Gumbel statistics measured in tiie case Ljl = 20 for 2D random Gaussian fields witfi power-spectra P{k) oc k". The 
spectral index, n = and n = —0.5, is indicated on each panel. The values n = —1 and n = —1.5 are examined on Fig. [2] 
First and second row of panels: pcii') and its logarithm as a function of u. The symbols correspond to the measurements in simulated 
data as described in the text. Vertical errorbars show Icr errors calculated from 100 independent realizations of the field. As indicated 
on the top panels, the solid curves correspond to our theoretical prediction (eas. ll6l 1211 1281 1381 1451 l47l 1481 1501 151I I: the short dashed ones 
are the same but assume that peaks are unclustered (Poisson limit, or A^c = 0, equivalently c = 1 in eg. 1211 but still egs. 1381 and 1451 to 
determine the peak abundance); the dotted ones further assume that the number density of peaks in the excursion is approximated by 
the Euler Characteristic (eg. I55t : the dot-dashed curves give the form JTJ fitted on the dotted curves, with the value of 7q obtained from 
matching the Taylor expansion discussed in ii 13.41 (egs. 1591 l65l 1671 169I I: Finally, the last curves (3 dots-one dash repeated) correspond 
to the asymptotic behavior l(3]l expected when the ratio L/£ approaches infinity: they are the same as the dot-dashed curves but with 
7G = 0. 

Third row of panels: Nc (solid curves) and §2 = ^^C2(i)/o'Q (dotted curves) as functions of i^. When Nc <^ 1, one expects the effect of the 

clustering of peaks to become significant. On the other hand, when f=J li o^^r description of the A'^-point correlation functions of peaks 

becomes inaccurate, but this only has a significant impact on the analytical calculation of pgM if ^ 1- Note that the intersection of 

the solid curve and the dotted curve is expected to be in the vicinity of the maximum of PgM- ^ nm r. n a o himti a o r.r.r> mnTTi 

(c) zUiU HAo, MiNxvAb UUU. lilliUI 
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Figure 2. Same as in Fig.[T] but for n = —1 and n = —1.5. 



distribution ([T]) (with 7g < 0), except in the high i/ tail. As 
expected, 7g — >■ when L/£ ^ oo and one converges slowly 
to the Gumbel type distrib ution, eq. ((3|, as shown long ago 
l|Bickel fc Rosenblatt|[l973 ). 

(iv) Our analytical calculations were successfully tested 
against numerical experiments of 2D scale-free Gaussian 
random fields, in particular in a regime where both A^'c <: 1 
and ^2 ^ 1) where the validity of the "exact" calculations 
mentioned in point (ii) remains questionable. 



Note that our calculations can be easily extended 
to non-Gaussian fields, using e.g. the formalism of 
iPogosvan. Gav fc PichonI (|2009l ) to estimate the number 
density of peaks, n( v), and modifications of the Press & 
Schechter formalism (|Press fc Schechterl 1974) to compute 
in the high u regim e (see for instance lPesiacaues fc SeliakI 
'2010'; 'Valagea^ I2OI0I . and references therein) . The hierar- 
chical relation S'^ ~ should still hold if > 1 

(|Bernardeau fc Schaeffei]|l999l ). as extensively discussed in 
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end of § 12.2.21 and in beginning of § 12.2.31 In the Poisson 
limit and for i/ ^ 1, the result obtained in point (iii) above 
should still hold; — In(PG) should be simply given by the 
Euler Characteristic, whic h itself can be easily es timated 
in the non-Gaus sian case ()Pogosvan. Gav fc Pichon 2009 ; 
iMatsubaralbOld '). In fact, one expects that Pq should still 
be well fitted by the f amily of distributions ([T| but with a 
different value of 70 (|Mikelson. Silk fc Zunt j I2OO9I '). How- 
ever, convergence to the asymptotic form ^ in the limit 
L/£ ^ <x remains to be proven. In the intermediary regime 
probed by the Euler Characteristic, the Gumbel statistics 
provides an interesting tes t of non-G aussianity, as shown 
experimentally by Mikclso n. Silk fc Z untz (200_9|) on simu- 
late d temperature ma ps of the CMB. In a companion pa- 
per (|Davis. et al.ll201ll '). we have studied applications of the 
Gumbel statistics to clusters of galaxies, where the quan- 
tity of interest is the probability distribution function of 
the mass of the most massive cluster in the patch (see also 
IHoIz fc Perlmutterl I2OI0I : ICavon. Gordon fc Silkll2010l ). We 
plan to apply 2D Gumbel statistics to the analysis of CMB 
data, including non-Gaussian corrections, in the near future. 
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